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Abstract. A classical theorem of Fritz John allows one to describe a convex 
body, up to constants, as an ellipsoid. In this article we establish similar 
descriptions for generalized (i.e. multidimensional) arithmetic progressions in 
terms of proper (i.e. collision-free) generalized arithmetic progressions, in both 
torsion-free and torsion settings. We also obtain a similar characterization of 
iterated sumsets in arbitrary abelian groups in terms of progressions, thus 
strengthening and extending recent results of Szemeredi and Vu. 



1. Introduction 

Define a convex body to be a compact convex subset of a Euclidean space R d with 
non-empty interior 1 . We say that a convex body B is symmetric if — 1 • B = B, 
where A • B := {Ax : x £ B} denotes the dilate of B. A classical theorem of John 
[10] characterizes such bodies up to constants: 

Theorem 1.1 (John's theorem, symmetric case). Let B be a symmetric convex 
body in R d . Then there exists a (closed) ellipsoid E centered at the origin such that 

E C B C Vd- E. 



The constant Vd here is sharp, as can be seen by considering the case when B is a 
box or cube. There is an analogous theorem for asymmetric convex bodies, but we 
will consider mainly symmetric situations here. 

The purpose of this paper is to investigate discrete analogues of John's theorem, 
when R d is replaced by a lattice T, a progression, or an iterated sumset. or more 
generally an arbitrary additive group G. For this, we shall need to replace the 
concept of an ellipsoid by the notion of a (proper) generalized arithmetic progression 
(GAP), which we now pause to define. Again we restrict attention to the symmetric 
case. 
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Definition 1.2 (Sumset notation). An additive group is an abclian group G = 
(G, +). If A, B are sets in an additive group G, we use A+B := {a+b : a e A,b e £?} 
for the sumset, A — B := {a — b : a e A, b e B} for the difference set, and |A| for 
the cardinality of A. For k > 1, we use fcA = A + . . . + A to denote the iterated 
sumset of fc copies of A. If n is an integer, we use n ■ A := {na : a € A} to denote 
the dilate of a by n, where na is the sum of n copies of a (with the conventions 
Oa = and (— n)a = — (na)). We caution that k ■ A ^ kA in general, although we 
do have k ■ A C kA. If / is a set of integers, we write I ■ A := {na : n e I, a e A}. If 
a, 6 are reals, we use [a, b]z ■= {n € Z : a < n < b} to denote the discrete interval 
and [a, 6]r := {x e R : a < x < b} to denote the continuous interval. Similarly 
define [a,b)z, (o,6)r, etc. 

Definition 1.3 (GAPs). Let G be an additive group. A symmetric general- 
ized arithmetic progression in G, or symmetric GAP for short, is a triplet P = 
(N,v,d), where the rank rank(P) = d is a non-negative integer, the dimensions 
N = (Ni , . . . , Nd) are a ci-tuple of positive reals, and the steps v = (v\ , . . . ,Vd) are 
a d-tuple of elements of G. 

• We define the image Image(P) C G of P to be the set 

Image(P) := [-N u N^z ■ «i + . . . + [-N d , N d ] z ■ v d 
d 

= {^2 n ^ '■ n * e l~ N i' N ^ yi = 1. ■ ■ ■ . <0- 

• For any t > 0, we define the dilate P t of P to be the GAP P t := (tN, v, d) 
formed by dilating all the dimensions by t. 

• We say that P is proper if all the elements n\V\ + . . . + n d v d for rij G 
[—Ni, N]z are distinct. More generally, we say that P is t-proper for some 
t > if P t is proper, and infinitely proper if it is i-proper for all t (i.e. the 
elements v\, . . . ,v d are algebraically independent). 

• We define the size of P to be size(P) := |Image(P)|. Observe that if P is 
proper if and only if size(P) = ]^[ i=1 (2 [A^J + 1). 

Remark 1.4. For technical reasons, we need to allow the dimensions Ni to be real 
rather than integer. Because of this, it is not always the case that Image(Pi) + 
Image(Pf) is equal to Image(Pt+f), although this is true if the components of 
tN and t'N have fractional parts in [0, 1/2)r. Instead, we only have the inclusion 
Image(P t )+Imagc(P t /) C Image(P t+ i') in general. On the other hand, by replacing 
each of the dimensions with their greatest integer part we can always assume that 
the dimensions are integer without affecting the image, rank or properness of P 
(although the image and properness of the dilates P t , will be affected). 

Remark 1.5. In most treatments, the progression P is identified with its image 
Image(P). However we shall avoid doing this here because many important fea- 
tures of the progression (such as the rank, or the dilates Image(P t )) are not com- 
pletely determined by the image alone. In particular, if P and Q are symmetric 
GAPs, an inclusion Image(P) C Image(Q) does not necessarily entail an inclusion 
Image(Pf) C Imagc(Q t ) even when t = 2 (unless the fractional parts of the all 
lie in [0,1/2) R ). 
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A lattice is a discrete additive subgroup of a Euclidean space. To start, we present 
an analogue of John's theorem on lattices. 

Theorem 1.6 (Discrete John's theorem). Let B be a convex symmetric body in 
JH d , and let V be a lattice in R d . Then there exists a symmetric, infinitely proper 
GAP P in r with rank rank(P) < d such that we have the inclusions 

(0(d)- 3d / 2 ■ B) n r C Image(P) CBnTC Image(P 0(d) 3 £i /2) (1) 

and more generally 

(0(dy 3d/2 t ■ B) n T C Image(P t ) C (t ■ B) n T (2) 

for any t > 0. Furthermore, we have the size bounds 

0{dy 7d ' 2 \B n T| < size(P) < \B n r|. (3) 

As usual 0{X) denotes a quantity bounded above by CX for some absolute constant 
C; thus for instance 0(d)~ d denotes a quantity bounded from below by (Cd)~ d for 
some absolute constant C. 

This theorem was essentially already established in [17, Lemma 3.36]. In an earlier 
paper [1], it was proved (see Theorem 3 of [1]) that if T is full dimensional then one 
can find a GAP P such that B is contained in the convex hull conw(P) of P and 
Vol(conv(P)) < Cd(Vol(B)), for some constant d depending only on d. 

For the convenience of the reader (and because this theorem will be used to prove 
our other results) we supply a proof of Theorem 1.6 in Section 2, taking the oppor- 
tunity to strengthen the bounds slightly and correct some misprints. 

1.7. The torsion-free case. Next, we consider progressions in torsion-free addi- 
tive groups G (thus nx ^ {0} for all x e G\{0} and n e Z\0); for instance any 
lattice is torsion-free. The natural question here is whether one can compare a 
non-proper progression P with a proper or a t-proper progression. In this regard, 
the following results are known: 

Theorem 1.8 (Progressions contain proper progressions). [17, Theorem 3.38] Let 
P be a symmetric GAP of rank at most d in an additive group G (not necessarily tor- 
sion free). Then there exists a proper symmetric GAP Q with rank(Q) < rank(P), 
Image(Q) C Image(P), and 

0«T 5d sizc(P) < size(Q) < size(P). 

Theorem 1.9 (Progressions contained in proper progressions). [17, Theorem 3.40], 
[8, Theorem 2.1] Let P be a symmetric GAP in a torsion-free group G, and let 
t > 1. Then there exists a t-proper symmetric GAP Q with rank(Q) < rank(P), 
Image(P) C Image(Q), and 

sizc(P) < sizc(Q) < (2t) d d 6d ' size(P). 
Furthermore, ifP is not proper, we may take rank(Q) < rank(P) — 1. 
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We remark that similar results were also obtained earlier by Bilu [2] and Chang 
[4]. The precise bound (2t) d d 6d is established in [8]; the argument in [17] only 
considers the case t = 1 and gives the weaker bound of d°( d \ We will be able to 
improve Theorem 1.9 in Corollary 1.11 (and Corollary 1.18) below. 

While these two results are already useful, they are not quite analogous to Theorem 
1.1 or Theorem 1.6 because they only provide one-sided containments; the original 
GAP P cither contains or is contained in a proper GAP Q, but the two proper 
GAPs on either side of P are not related to each other by a dilation. 

Ideally we would like to take any GAP P of rank d and a parameter t > and obtain 
inclusions Imagc(Q e ) C Image(P) C Imagc(Q) for some i-proper GAP Q and some 
e > depending in a reasonable manner on d and t. However this is not feasible 
once t > 2, even when G is just the integers Z. To see this, let A'' be a large even 
integer and consider the rank 2 progression P := ((A/2, A), (1, A), 2). The image 
of this progression fills out most of the discrete interval (—A 2 + A/2, A 2 — A/2)z, 
but misses the half-integer multiples of A. Suppose that we could find a 2-propcr 
progression Q = (M, v, d) whose image contained Image(P). Thus we have a map 

d 

f : Imagc(P) - ^{[-M^M^ C Z d 

such that x = f(x) ■ v for all x <G Image(P). Since Q is 2-proper, we easily see that 
/ is not only uniquely defined, but is also a Freiman homomorphism, thus 

f(x\)+f(x2) = f{xs)+f(xi) whenever x\, X2, £3, £4 € Image(P) and xi+x 2 = x 3 +x 4 . 
From this and the explicit form of Image(P) it is not hard to show that / must now 
be linear, thus f(x) = xf(l). In particular, the discrete box Af i: Mjj^ must 

contain the one-dimensional progression [—A 2 + A/2 + 1, A 2 — A/2 — l]z ■ /(l)- 
From this we see that Image(Q e ) contains [e(-A 2 + A/2 + 1), e(N 2 - A/2 - l)] z , 
and so for fixed e and large A we cannot have Image(Q e ) C Image(P). 

It is likely that similar examples can also be constructed to cover the case t = 1; 
the GAP P would thus be almost proper (with only a small number of collisions) , 
and its image would cover most of the image of a genuinely proper GAP Q, but 
there would be enough "holes" in the image that P could not accomodate a smaller 
version Q e of Q for fixed e. 

Nevertheless, we can get around this problem by weakening the conclusion in a 
number of ways. One way is to no longer require the outer progression to be 
proper, but instead demand only that the inner progression is proper. This leads 
to our first main result: 

Theorem 1.10 (John's theorem for GAPs). Let IP be a symmetric GAP of rank 
d > in a torsion-free group G, and let t > 1. Then there exists a t-proper 
symmetric GAP Q of rank at most d, such that we have the inclusions 

Image(Pt-) C Image(Q 0(d)3d /2 tt ,) 

for all t' > and 

Image(Qtt') C Image(Pf) 
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for all t' > 1 . In particular, we have 

Imagc(Q) C Imagc(P) C Image(Q 0(d)3 <i/2j). 
Furthermore we have the size bound 

t -d 2 -d 2 -0(dlogd) size( pj < s i ze (Q) < sizc(P). 

Finally, ifP is not 1/2-proper, then Q can be chosen to have rank at most d — 1. 

We prove this theorem (a fairly simple consequence of Theorem 1.6 and a rank 
reduction argument) in Section 4. Note that it implies an analogue of Theorem 1.8 
but with significantly weaker constants. 

As a corollary we can obtain the following improvement of Theorem 1.9. 

Corollary 1.11 (John's theorem for GAPs, again). Let P be a symmetric GAP 
of rank d > in a torsion-free group G, and let t > 1. Then there exists a t-proper 
symmetric GAP Q of rank at most d, such that we have the inclusions 

Image(Pf) C Image(Q t /) 

for all t' > and 

Image(Qtt') C Image(P 0(d) 3<i/2 tt ,) 
for all t' > 1 . In particular, we have 

Image(P) C Imagc(Q) C Image(P 0(d)3£i /2 t ). 
We also have the size bound 

size(P) < size(Q) < t d O{df d2/2 &iic{V). 
Finally, ifP d 3d/2 t is not proper, then Q can be chosen to have rank at most d—1. 

Proof Apply Theorem 1.10 with P replaced by P ^3d/2 t for a sufficiently large 
choice of O(d). ■ 



1.12. The torsion case. Now we turn to the case where G is allowed to contain 
torsion (in particular, G could be a finite group); equivalently, G contains non-trivial 
finite subgroups. Here, it is no longer reasonable to work with t-proper GAPs for 
any t > 2. For instance, if G is a non-trivial finite group, then (by the classification 
of such groups) G is the image of a proper GAP, but cannot be the image of a 
t-proper GAP for any t > 2. Instead, as first observed by Green and Ruzsa [9], one 
should replace GAPs by the more general notion of a coset progression: 

Definition 1.13 (Coset progressions). Let G be an additive group. A symmetric 
coset progression in G is a quadruplet P = (A, v, d, H), where the rank rank(P) = d 
is a non- negative integer, the dimensions N = (Ni, . . . , Nd) are a d-tuple of positive 
reals, the steps v = (vi, . . . , Vd) are a d-tuple of elements of G, and the symmetry 
group H is a finite subgroup of G. 
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• We define the image Image(P) C G to be the set 

Image(P) := H + [-N u JVi] z ■ vi + . . . + [-N d , N d ] z ; 

thus Image(P) is the sum of a subgroup and the image of a GAP. 

• For any t > 0, we define the dilate Pt of P to be the coset progression 
P t = (tN,v 7 d 7 H) formed by dilating all the dimensions by t but keeping 
the symmetry group fixed. 

• We say that P is proper if all the elements h + n\V\ + . . . + n^Vd for rij G 
[— Ni,Ni]z and h € H are distinct. More generally, we say that P is t- 
proper for some t > if P t is proper, and infinitely proper if it is i-proper 
for all t (i.e. the elements v\, . . . ,Vd are algebraically independent modulo 
H). 

• We define the size of P to be size(P) := |Imagc(P)|. Observe that if P is 
proper if and only if sizc(P) = \H\ Ui=i( 2 l N i\ + !)■ 

Remark 1.14. Of course, GAPs correspond to the special case H = {0}; also, finite 
subgroups of G are essentially coset progressions of rank 0. Note that H itself may 
require a large number of generators, but that this number has no bearing on the 
rank of P. 

Coset progressions are essential tools in the study of sum sets on arbitrary groups. 
We mention two key (and closely related) theorems from [9] in this regard: 

Theorem 1.15 (Chang's theorem in an arbitrary group). [Section 5][9] Let A C G 
be a non-empty finite set such that \2A\ < K\A\ for some K > 2. Then there 
exists a proper coset progression P in G of rank 0(K 3 log K) and size size(P) > 
exp(-0(K 3 log 2 K))\A\ such that Image(P) C 2A - 2A. 

Theorem 1.16 (Frciman's theorem in an arbitrary group). [9, Theorem 1.1] Let 
A C G be a non-empty finite set such that \2A\ < K\A\ for some K > 2. Then 
there exists a coset progression P in G of rank 0{K A log K) and size at most 
eicp(0(K 4 log 2 K)\A\) such that A is contained in a translate o/Image(P). 

One should compare these results to Theorems 1.8 and 1.9 respectively. Alternate 
proofs of these results (with slightly weaker constants) can also be found in [17, 
Theorem 5.48] and [17, Theorem 5.44] respectively. 

In Section 5 we establish the following generalization of Theorem 1.10: 

Theorem 1.17 (John's theorem for coset progressions). Let P be a symmetric coset 
progression of rank d > 0, and let t > 1. Then there exists a t-proper symmetric 
coset progression Q of rank at most d, such that we have the inclusions 

Image(P t /) C Image(Q 0(d)3<1 /2 M ,) 

for all t' > and 

Image(Qtt') C Image(P t /) 
for all t' > 1 . In particular, we have 

Imagc(Q) C Imagc(P) C Imagc(Q 0(d)3ti /2 t ). 
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Furthermore we have the size bound 

t- d 2- d2 -° (dlogd) size(P) < sizc(Q) < sizc(P). 

Finally, the symmetry group of Q contains that ofP, and ifP is not 1/2-proper, 
then Q can be chosen to have rank at most d — 1. 

Note that when P lies in a torsion-free group, the symmetry group must be trivial, 
and so Theorem 1.17 contains Theorem 1.10 as a special case. 

By repeating the proof of Corollary 1.10 we obtain 

Corollary 1.18 (John's theorem for coset progressions, again). Let P be a sym- 
metric coset progression of rank d > 0, and let t > 1. Then there exists a t-proper 
symmetric coset progression Q of rank at most d, such that we have the inclusions 

Image(P 4 /) C Image(Q t <) 

for all t' > and 

Image(Q tt /) C Image(P 0(d)3 <i/2 M ,) 
for all t' > 1 . In particular, we have 

Image(P) C Imagc(Q) C Image(P 0(d)3 d/2 t ). 
We also have the size bound 

size(P) < size(Q) < t d O{df ' ' 2 : size(P). 
Finally, ifP d 3d/2 t is not proper, then Q can be chosen to have rank at most d— 1. 

1.19. Sumsets in groups. Now we consider the question of establishing John- 
type theorems for iterated sum sets IA for I large. We will be interested here in 
the "additively structured" case when there is plenty of additive relations between 
elements of A, either because A is contained in a structured set such as a progres- 
sion, or because the iterated sumsets IA are fairly small. For instance, we have the 
following recent result of Szemeredi and Vu. 

Theorem 1.20 (Sumsets in integers). [16] For every integer d > 1 there exists 
C, e > such that the following statement holds: whenever N > 1, I > 1 and 
A C [1,N] Z are such that l d \A\ > CN, then there exists a proper symmetric GAP 
Q of rank 1 < d! < d and size at least el d \A\ such that IA contains a translate of 
Image(Q). 

For d = 1, much more precise results in this direction are known: see [15], [5], [14], 
[11]. For further discussion of this result, including the sharpness of the various 
bounds, we refer the reader to [16] and [17, §12]. An alternate proof of this result 
can be found in [17, Theorem 12.4]. For variants of this theorem when I = 2, 3, 4 is 
small, see [3], [6], [7], and Theorem 1.15 above. 

Using the above machinery, together with the Frciman-type theorems of Green and 
Ruzsa, we can now generalize this statement to arbitrary additive groups with more 
explicit bounds and a slightly stronger statement (giving both an upper and lower 
containment for I A). 
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Theorem 1.21 (John's theorem for iterated sumsets). There exists a positive in- 
teger C\ such that the following statement holds: whenever d > 1, I > 1 and 

Ci 2^^ 

A C G is a non-empty finite set such that l d \A\ > 2 2 then there ex- 

ists a proper symmetric coset progression Q of rank < d' < d — 1 and size 

sizc(Q) > 2- 2 ° ld 2 l d '\A\ and x,x' £ G such that 

x + Image(Q) ClACi'l 2 2Cld226d Image(Q). 

Remark 1.22. The triple exponential dependence on d is somewhat unsatisfactory; a 
single exponential would be more natural. One exponential arises from the current 
best known bounds in Freiman-type theorems (see Theorem 1.15). Another arises 
from the need to ensure that a sumset of Imagc(Q) can cover a sumset of A, which 
can temporarily exponentiate the rank of the progression. It may be that one 
of these exponential losses can be removed, or that two of them can be run "in 
parallel" , reducing the total loss to a double exponential, but we will not attempt 
to do so here. In the asymptotic limit I — > oo, much more about the structure of I A 
is known, for instance \IA\ is eventually a polynomial in I [12], [13]. The behavior 
of IA for large I is also closely connected to Theorems 1.15, 1.16; see [9] for further 
discussion. 

Remark 1.23. When A is symmetric, thus A = —A, and one has e A (or I is 
even), then one can take x — x' = by exploiting the identity 21 A = IA — I A and 
the inclusions IA C I' A when I' > I and e A. 

We prove this theorem in Section 7. Note that Theorem 1.20 emerges as a special 
case since in that case we have \IA\ < \l[l,N]z\ < IN. The d = 1 case of this 
theorem already implies 

Corollary 1.24 (Sarkozy's theorem in an arbitrary group). There exists an abso- 
lute constant Ci > such that the following statement holds: whenever I > 1 and 
A C G is an element of a finite additive group such that l\A\ > C2IGI, then IA is a 
coset of the subgroup generated by A — A. 

Proof Applying Theorem 1.21 with d = 1 we see that (for C2 large enough) we 
obtain a coset progression Q of rank with size(P) > el d \A\ for some absolute 
constant e > 0, with IA containing a translate of Image(Q) and being contained in 
a translate of CTmagc(Q) for some absolute constant C. Since Q has rank 0, Q is a 
subgroup and it is easy to check that Q is generated by A — A. The claim follows. 



2. The discrete John's theorem 

We now prove Theorem 1.6, following the arguments in [17, Section 3.5]. We may 
assume that T has full rank (i.e. its linear span is all of R d ), since otherwise we 
can restrict to the linear span of T. 
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Applying John's theorem and a linear transformation we may assume that 

B d CB<zVd-B d (4) 
where B d is the (closed) unit ball in R d . We recall the standard formula 

where mes denotes rf-dimensional Lebesgue measure. In particular 

mes(B) < Vd d mes(B d ) < 0(l) d . (6) 

By the theory of Mahler bases (specifically, see [2] or [17, Corollary 3.35]) we may 
find linearly independent v\, . . . ,v d which generate T, and such that 

mcs(R d /r) = \vi A . . . A v d \ > 0{d)- 3d/2 \v 1 \ . . . \v d \. (7) 

Indeed, the argument given in [17] allows one to write the 0(d)~ 3d / 2 factor more 
explicitly as ^i^. 

A volume packing argument (see [17, Lemma 3.26]) gives 

3 d d\mcs(B) 0{df d / 2 
|Snr| -2^mcs(RVr)-K|...k| (8) 
where we have used (6). 

Let us now set P to be the symmetric GAP P = (P,N,v,d), where v := (vi , . . . , v d ), 
N := (N\, . . . , N d ), and Ni :— ^jj- Since vi, . . . ,v d are linearly independent, we 
see that P is infinitely proper. Observe that 

si Z e(P)>;QA d = | f \ | 

and so the first inequality in (3) follows from (8). Next, observe that if x e 
Image (P t ) then 

|a;|<tiVi|«i| + ...+tJV ( ,|«« I |=t 

and so x £ (t ■ B d ) C B. This gives the second containment in (2); setting t = 1 we 
also obtain the second inequality in (3). 

Now let x e r. Since Vi, . . . ,v d generate L, we have x = niVi + . . . + n d v d for some 
integers ni, . . . ,n d . From Cramer's rule we have 

[a; A vi A . . . A i>j_i A v l+1 A . . . A v d \ \vi\...\v d \ 

< \x\ 



\vif\...Av d \ \vi\\vi A . . . A v d \ 

Applying (7) and the definition of Ni we conclude that 

\n t \ < 0{df d ' 2 \x\N t 

and so 

x e Image(P 0(d) 3<i/2| x |). 

Using this and (4) we obtain the first inclusion in (2). Since (2) clearly implies (1), 
we obtain Theorem 1.6. ■ 



10 



TERENCE TAO AND VAN VU 



3. Convex progressions 

It will be convenient to generalize from arithmetic progressions to the more geo- 
metric concept of a convex progression, as these are more stable under operations 
such as restriction or projection to a subspace. The idea of working with convex 
progressions was inspired by [8] . In view of our eventual generalization to the tor- 
sion case, we shall also allow the inclusion of a finite symmetry group, leading to 
convex coset progressions 

Definition 3.1 (Convex progressions). A symmetric convex coset progression in 
an additive group G is a quintuplet P = (B, T, d, 4>, H), where the rank rank(P) := 
d > is an integer, B is a convex body in R d , T is a lattice in R d , : T — > G is a 
homomorphism. We define the image of P as 

Image(P) := 4>{B n V) + H C G 

and the size of P as sizc(P) := |Image(P)|. If t > 0, we define the dilates P t := 
(t ■ B, T, d, <j>, H). We say that P is proper if the map (x, h) ^> <p(x) +h is is injective 
on (B fl r) x H , t-proper if P t is proper, and infinitely proper if it is i-proper for 
all t. If H = {0} we refer to a symmetric convex coset progression as simply a 
symmetric convex progression. 

Observe that every symmetric GAP P = (P, N,v,d) is also a symmetric convex 
progression of the same rank and size, setting 2 B :— Yl i= i[—Ni, AT,]r, T := Z d and 
(f>(ni, . . . , rid) n\V\ + . . . + n^Vd for all (m, . . . , rid) € Z d , and that the notions 
of dilation and properness are consistent. Thus we can view convex progressions as 
generalizations of GAPs. Similarly convex coset progressions are generalizations of 
coset progressions. 

We observe the useful sumset embedding 

Image(P t ) + Image(P t /) C Image(P t+t /) 

and hence 

Admage(P 4 ) C Image(P fct ) 

for t, t' > and k = 1,2, ... , and all symmetric convex coset progressions P. Wc 
shall use these embeddings frequently in the sequel without further comment. 

Remark 3.2. One way to view these embeddings is that they induce a translation- 
invariant pseudo-metric distp : G x G — > [0, +oo] on G, defined by setting 

distp(x, y) := inf{i > : x — y e Imagc(P t )} 

with distp (x, y) = +oo if no such t exists (i.e. if x — y is not in the group generated 
by H and the v\, . . . ,Vd)- We will however not use this metric structure explicitly. 

We now give two basic facts about these progressions. 



It is here that we are using our choice that convex bodies are compact rather than bounded 
open. One could work with bounded open bodies by redefining GAPs using (— iVj, Ni)z instead 
of [— JVj, Ni]z, but this creates some (minor) technical problems regarding dilations. 
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Lemma 3.3 (Doubling lemma). If P is a convex coset progression of rank d and 
t > 1, then Image(Pt) can be covered by at most {At + l) d translates o/Imagc(P). 
In particular 

size(P) < size(P t ) < {At + l) d sizc(P). 

Proof It suffices to establish the first claim. By applying the quotient map G — > 
G/H we can reduce to the case H = {0}. In this case the claim follows from [17, 
Lemma 3.21]. Here we use compact convex bodies rather than open bounded ones, 
but one can pass from one to the other by dilating by an epsilon and then sending 
that epsilon to zero; we omit the details. ■ 

Remark 3.4. In many cases one can lower the {At+ l) d factor to be closer to t d , for 
instance if t is an integer and P is a GAP with integer dimensions, but this type 
of improvement will not significantly improve our results here. 

Lemma 3.5 (Covering lemma). Let P,Q be convex coset progressions of rank d 
such that Image(Q) C Image(P t ) for some t > 0. Then for any t' > 7 Image(P t ') 
can be covered by at most 0{t + t' + l) d sizc(P)/size(Q) translates o/Image(Q). 

Proof Applying Ruzsa's covering lemma (see e.g. [17, Lemma 2.14]) we conclude 
that Image(Pi') can be covered by |Image(Pe) + Imagc(Q)|/|Image(Q)| translates 
of Image(Q) — Image(Q). But by Lemma 3.3 

|Image(P t ') +Image(Q)| < |Image(P t <) + Image(P t )| 

< |Image(P t , +t )| 
= size(P t ' +t ) 

< 0(t'+t+l) d size(P) 

while Imagc(Q) — Imagc(Q) C Imagc(Q 2 ) can be covered by 0{l) d translates of 
Imagc(Q). The claim follows. ■ 



4. John's theorem in torsion-free groups 

The purpose of this section is to prove Theorem 1.10. The arguments here will 
be superceded by those in Section 5, but we present these arguments first in the 
simpler torsion-free setting for expository purposes. 

The key ingredient in the proof is the following rank reduction dichotomy in the 
convex progression setting. 

Lemma 4.1 (Lack of properness implies rank reduction). Let P be a symmetric 
convex progression of rank d in a torsion-free group G, and suppose that Pi is not 
proper. Then there exists a symmetric convex progression Q of rank d—1 with the 
inclusions 



Image(P t ) C Imagc(Q 2t ) 



(9) 
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for all t > and 

Image(Q t ) C Image(P t ) (10) 

for all t > 1 . 

Proof Write P = (B, T, d, 0, {0}). Since Pjm is not proper, there exists distinct 
x,x' € (| ■ B) fl T such that 0(a;) = <l>{x'). Setting y := x — x', we thus have 
y e B n r\{0} and </>(y) = 0. We may factorize y = ny', where n > 1 is an integer 
and y' € r\{0} is irreducible (thus j/' 7^ m • T for any integer m > 1). Since G is 
torsion free, we thus have y' € B n T\{0} and </>(y') = 0. 

Since y' is irreducible, we can split V = (Z • y') + V where T' is a lattice of rank at 
most d— 1 (see e.g. [17, Corollary 3.5]). Applying an invertible linear transformation 
to r and B (and the inverse linear transformation to (f) we can normalize y' = ed 
and r' C R d_1 , where e\, . . . , is the standard basis of R d and R d_1 is the span 
of ei, . . . ,e d _ 1 . Let tt : R d — > R d_1 be standard projection, then we see that tt 
maps T to T' = T n R d_1 . Since <j){ed) = 0, we see that we can factor 4> = cj)' o n for 
some homomorphism </>' : V — > G (in fact </>' is just the restriction of </> to T'). We 
now set Q to be the symmetric convex progression 

Q:= (1.7r(B),r',d-l,0',{O}). 

Since 

■ B) n r) c 0'((t • n(B)) n r') = 0'((2t • \ ■ tt(B)) n r') 

we obtain (9). The only remaining thing to prove is (10), or in other words that 

^•^))nr')c^.B)nr). 

Since (f> — <p' o tt, it suffices to show that 

(^■7r(B))nr'C7r((t-B)nr). 

But if z e (| • 7r(B)) n F, then 2z e ?r(i • B) n T', and so there exists w E 7r" 1 (2z) 
such that w e f • B. But since e<j = y' e B, and B is symmetric, we also have 
ted, —ted E t ■ B. We conclude from convexity that ^ + t'ed € B for all — 1/2 <t'< 
t/2. Since n(w/2) = z e V, T = (Z • e<j) + T', and t > 1, we conclude that there 
exists -t/2 < t' < t/2 such that f + t'e d e (t ■ B) n T. Since 7r(f + t'e d ) = z, we 
obtain the desired inclusion. ■ 

We can iterate the above lemma to obtain 

Corollary 4.2 (John's theorem for convex progressions). Let P be a symmetric 
convex progression of rank d> in a torsion-free group G, and let t > 1/2. Then 
there exists a t-proper symmetric convex progression Q of rank r for some < r < d, 
such that we have the inclusions 

Image(Pt>) C Image(Q 2 d- r+ i tt ,) 

for all t' > and 

Image(Q 2 tt') C Image(Pf) 
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for all t' > 1. Furthermore, if P is not 1/2-proper, then Q can be chosen to have 
rank r < d — 1 . 

Proof It suffices to prove the claim for t = 1/2, as the general case then follows 
by replacing Q by Qi/(2t)- We now perform the following algorithm. 

• Step 0. Initialize d = r and Q = P. 

• Step 1. If Q1/2 is already proper then STOP. 

• Step 2. Otherwise, use Lemma 4.1 to replace Q by a symmetric con- 
vex progression Q' of rank r — 1 such that Image(Q') C Imagc(Q) and 
Image(Qt) C Imagc(Q 2t ) for all t > 0. 

• Step 3. Replace r by r — 1 and return to Step 1. 

This algorithm terminates with r > since progressions of rank are trivially 
proper. The claims are then easily verified. ■ 

Proof [Proof of Theorem 1.10] It suffices to verify the claim for t = 1, since the 
claim for larger t then follows by replacing Q by Q\/ t (and using Lemma 3.3 to 
recover the size bound for Q). 

Since P is a symmetric GAP, it is also a symmetric convex progression. Wc 
can thus invoke Corollary 4.2 and find a convex 1/2-proper progression P' = 
(£?', F', df, <f>', {0}) of rank d! < d with the inclusions 

Imagc(P t ) C Image(P 2dt ) for all f > 

and 

Image(Pj) C Image(P t ) for alH > 1. 
Also, if P is not 1/2-proper, then d' < d — 1. 

By Lemma 3.3 we conclude 

sizc(P) < sizc(P^) < 2 d2+0 ( d )size(P' 1/2 ). 

Now, by Theorem 1.6 (applied to | • B' and V) we can find a proper symmetric 
GAP Q in R d of rank at most d such that 

{0{d)- M ' 2 t ■ B 1 ) nr'c Image(Q t ) C (t ■ B') n V 

for alH > 0, and also 

size(Q) > 0(d)- 7d / 2 \^B' n F'|0(rf)- 7d / 2 sizc(P' 1/2 ) 

and thus 

size((3) > 2- d2 -°( dlogd )size(P). 

We can push forward the steps in Q by <p' to create a symmetric GAP Q in G of 
rank at most d' < d. Since Q is proper and </>' is injective on (^ • B 1 ) n T we see 
that Q is also proper and has the same size as Q. Now for any t > 1 we have 

Imagc(Q t ) C (I ■ B') !~l T' C (i ■ B') n V 
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and thus applying 4>' we have 

Imagc(Q t ) C Image(Pj) C Image(P t ) 
as desired. Similarly, for any t > we have 

(2 d t ■ B') nr'c Image(Q 0(d) 3 d/2t ) 
and thus on applying <j>' we have 

Image(P t ) C Image(P 2<it ) C Image(Q 0(d) 3d/2 t ) 
as desired. The claims of Theorem 1.10 have now all been established. ■ 

Remark 4.3. Note that the bounds in Corollary 4.2 are slightly better than those 
for Theorem 1.10 (the dilations are of the order of 2°^ rather than d°^ d \ although 
the size bounds are comparable) . This suggests that in applications it may be more 
efficient to work with convex progressions instead of GAPs whenever possible. See 
also [8] for further discussion. 

5. The torsion case 

We now extend the arguments of the previous section to the torsion case to prove 
Theorem 1.17. The key ingredient is the following torsion variant of Lemma 4.1. 

Lemma 5.1 (Lack of properness implies rank reduction). Let P be a symmet- 
ric coset convex progression of rank d in a group G ( not necessarily torsion-free ), 
and suppose that Pi is not proper. Then there exists a symmetric convex coset 
progression Q of rank d — 1 with the inclusions 

Image(P t ) C Imagc(Q 2t ) (11) 

for allt>0 and 

Image(Q t ) C Image(P t ) (12) 

for all t > 1 . Furthermore, the symmetry group of Q contains the symmetry group 
'of P. 

Proof We follow closely the proof of Lemma 4.1. Write P = (B, T, d, <fi, H). Since 
Pi/2 is not proper, there exists x, x' e (|-B)nr and h,h' £ H with (x, h) ^ (x\ h!) 
such that 4>(x) + h = <p{x') + h! . Note that this forces x ^ x' . Setting y := x — x', 
we thus have y € B n T\{0} and <j>(y) e H. 

As before we can find y e B n T\{0} with <j>(y) e iJ, and can write y = ny' 
where y' € Bfir\{0} is irreducible. Unfortunately, we can no longer conclude that 
4>{y') € H, only that n<p(y') € H. However, we can shrink n (and thus y) and 
assume that n is the minimal positive integer such that n<j>(y') € H; note that we 
still have ny E B n P\{0}. Let H' be the group generated by H and 4>{y')\ this is 
then finite (indeed \H'\ — n\H\). In particular, 4>(y') has finite order. 

As before we can normalize y' = e& and split T = (Z • e<j) + T' where L' C R d_1 , 
and let tt : R d — > R d_1 be the usual projection, thus tt maps L to T'. If we define 
4>' : V — > G to be the restriction of <j> to </>', then it is no longer true that (f> — cf>' on; 
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instead, 4>{x) and 4>'(ir(x)) can differ by an element of the finite cyclic group (4>(ed)}- 
We now set Q to be the symmetric convex coset progression 

Q:= (^.n(B),r',d-l^',H'). 

Since 

0((t • B) n r) + h c 4>'{(t • 7r(B)) n r') + (0(e d )) + # = 0'((2t • X - ■ n(B)) n r') + h' 

we obtain (11). The only remaining thing to prove is (10), or in other words that 

0'((| • n(B)) n r') + if' c .B)nr) + E 

Accordingly, let z e (| • k(B)) n r" and ft' e if'; we need to show that 
0(z) + ft' G 0((t ■ B) n T) mod if. 

By definition of if', we have ft' = l<p(ed) mod if for some integer /. As in the proof 
of Lemma 4.1 we can find w € 7r _1 (2z) such that w E t ■ B. But ne^ = y E B, and 
so by convexity as before we have ^ + t'e^ e B for all —tn/2 < t' < in/2. Since 
n(w/2) — z, we can express w/2 = z + me^ for some integer to, thus 

z + (to + i')e d e B (1 T for all - tn/2 < t' < tn/2. 

Since t' > 1, we may select t' such that m + t' — I is a multiple of n. Since 
n(f){ed) = mod if, we conclude that 

4>(z) + ft' = </>(z + (m + t')e d ) - (to + t' - O0(ed) = </>(z + (m + t')e d ) mod 

and the claim follows. ■ 

We can then iterate the proof of Corollary 4.2 more or less verbatim to obtain 

Corollary 5.2 (John's theorem for convex coset progressions). Let P be a sym- 
metric convex coset progression of rank d > 0, and let t > 1/2. Then there exists a 
t-proper symmetric coset convex progression Q of rank r for some < r < d, such 
that we have the inclusions 

Image(P t ') C Image(Q 2 d-r+i tt ,) 

for all t' > and 

Image(Q 2 tt') C Image(P t /) 

for all t' > 1. Furthermore, the symmetry group of Q contains that ofP, and ifP 
is not 1/2-proper, then Q can be chosen to have rank r < d — 1. 

Theorem 1.17 then follows from Corollary 5.2 in exactly the same way that Theorem 
1.10 follows from Corollary 4.2 with only minor changes (e.g. replacing "GAP" with 
"coset progression" throughout, and tensor summing <f>' with the identity map on 
H). We leave the details to the reader. 
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6. Coalescence of progressions 

We now prove a variant of Theorem 1.21 for cosct progressions, which will in fact 
play a crucial role in the proof of that theorem. 

Lemma 6.1 (Coalesence of coset progressions) . LetP be asymmetric coset progres- 
sion of rank d > 0, and let I > 1 be an integer. Then there exists a proper symmetric 
coset progression Q of rank < d! < d such that size(Q) > d~ 3d / i 2~ Cad l d size(P) 
for some absolute constant Co > and 

Image(Q) C ttmage(P) C L(C d) 3c(2/4 JImage(Q). 

In particular Image(Q) generates the same group as Image(P). 

See [17, Lemma 12.6] for a one-sided variant of this result in the torsion-free setting; 
earlier results in this direction are in [16]. In fact our proof here is based on the 
proof of that lemma. 

Proof We induct on d. The case d = is trivial (setting Q := P), so suppose that 
d > 1 and the claim has already been proven for all smaller values of d. We will fix 
Co to be a very large absolute constant to be chosen later. 

By shrinking the dimensions we may assume that all dimensions of P are integers; 
in particular, fclmage(P) = Imagc(P fc ) for all A; > 1. We may also assume without 
loss of generality that I is a power of two and I > (Cc?) 3d / 2 , where C is a large 
absolute constant to be chosen later. 

Let I' be the greatest integer less than l/(Cd) 3d ^ 2 . Suppose first that P is I'-proper. 
Then we see that 

|Z'Image(P)| = |size(Pj/)| > 0(C£fT 3d/2 Z d sizc(P) 

and the claim follows by taking Q := P;/. Thus wc may assume that P is not 

Z'-proper. Thus there exists < k < log 2 I' such that P is 2 fe -propcr but not 

2 fc+i 

-proper. In particular, P 2 k + 2 1S n °t 1/2-proper. Hence by Corollary 1.11 (with 
t = 1) we can find a proper symmetric coset progression P' of rank < d" < d — 1 
such that 

Image(P 2 fc+2) C Image (P') C Image(P ( d )3d/2 2 fc). (13) 

In particular we see that Image(P') generates the same group as Image(P), and 
also 

size(P') > sizc(P 2fc ) > 2 dfc -° (d) sizc(P). (14) 

We then apply the induction hypothesis with P replaced by P' and / replaced 
by I" — 0(d)~ 3,d / 2 2~ k l and conclude that there exists a proper symmetric coset 
progression Q of rank < d' < d" such that 

Image(Q) C /"Image(P') C [(C (d - l)) 3 < d - 1 ) 2 ' /4 JImagc(Q) (15) 

and 

sizc(Q) > ( d ")-3(0 2 /4 2 -Co(<n 2 {l »f size ( P '). 
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Since d" < d — 1, wc thus conclude (estimating (d")~ 3 ( d ) 2 / 4 from below by 
^-3(d-i) an( j usm g (24)) that (if C is large enough) 

sizc(Q) > d- 3d2 / 4 2- Cod2 size(P). 

Meanwhile, from (13), (15) we have (if I" was chosen correctly, and Co is large 
enough) that 

Imagc(Q) C Hmage(P) 

C L(C (d- l)) 3(d - 1)2/4 JO(d) 3d / 2 Image(Q) 
C L(C d) 3d2/4 JImagc(Q) 
and the claim follows. ■ 



7. SUMSETS IN ARBITRARY GROUPS 



We now prove Theorem 1.21. We broadly follow the strategy in [16], showing that 
iterated sumsets first contain large symmetric sets, then large coset progressions of 
high rank, then large coset progressions of low rank. 

We first translate A so that £ A, so that the group generated by A — A is nothing 
more than (A), the group generated by A. Also we see that the iterated sumsets kA 
are nested in k. We shall also take C\ to be a sufficiently large absolute constant 
to be chosen later. 



For any integer < k < log 2 we have 

\2 k A\ < |L4| < 2~ 2 l d \A\. 

In particular this forces 

I > 2 2Cld22 > (16) 

If we set k' to be the first integer for which 

\2 k ' +1 A\ < 2 d \2 k ' A\ (17) 

we thus see (from the pigeonhole principle) that k' exists and is less than log 2 /, 
and furthermore since 

\2 k ' A\ > 2 dk '\A\ 



we have 

and thus in particular 



2 dk ' < 2- 2Cld226 V 



k' <\og 2 l-2 c > d226d /d. (18) 

We will shortly encounter a need to replace 2 k ' A by a symmetric set. For this wc 
need the following lemma (a corrected version of [17, Exercise 2.3.14]): 

Lemma 7.1 (Small doubling implies large symmetric sets). Let A C G be finite 
with \2A\ < K\A\ for some K > 1. Then there exists F C A and x € G such that 
F = x-F and\F\> \A\/K. 
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Proof There are \A\ 2 possible sums of the form a + b with a, b <E A, which lie in a 
set 2 A of cardinality at most By the pigeonhole principle, we can thus find 

x € 2A which can be written as such a sum in at least \A\/K ways. The claim then 
follows by setting F :— {a e A : x — a e A}. ■ 

Applying this to 2 k ' A we can find F C 2 k ' A and x such that F = x — F and 

|F| > 2- d \2 k 'A\ > 2 dk '-°W\A\. 

In particular, 

\2F\ < \2 k ' +1 A\ < 2 2d \F\. 
Applying Theorem 1.15 to F, we conclude that there exists a symmetric proper 
coset progression P of rank r — 0(d2 6d ) and size 

size(P) > 2-°^ 2 ^ \F\ > 2-o(^ 2 2 M ) \ 2 k ' A\ > 2 dk '-°^ 2 ^ \A\ (19) 

such that 

Imagc(P) C 2F - 2F. 

Since F — xq — F, we have 

2F - 2F = AF - 2x C 2 k ' +2 A - 2x 

and hence 

2x + Imagc(P) C 2 k ' +2 A. 

In particular 

|2 fc ' +2 yl + Imagc(P)| < |2 fc ' +3 ^|. 
But from (17) and Plunnecke estimates (see [17, Section 6.5]) we have 

\ 2 k '+ 3 A\ < 2 0( - d) \2 k ' +2 A\. 

Using the Ruzsa covering lemma as in Lemma 3.3, we thus see that we can cover 
2 k ' +2 A by up to 2 0(d226£i) translates of Image(P). Thus we may write 

m 

2 k ' +2 A C (J 2x + a l + Imagc(P) 

i=l 

for some a\, . . . , a m with 

m = (2°&**>), (20) 

which we may take to lie in 

a, e 2 k ' +2 A - (2x + Imagc(P)) e 2 k ' +2 A - 2 k ' +2 A. 
Thus we can write a, = 6, — Cj for some bi, Ci G 2 k +2 A. 

If P = (N, v, r, H), we let P' be the larger coset progression 

P' := (N®{1,... ,l),v®(a u ... ,a m ),r + m,H) 

formed by adjoining m new steps oi, . . . , a m with dimensions one each. Then we 
have 

size(P') > sizc(P) > 2 dk '-° {d226d) \A\ (21) 

and 

rank(P')=r + m<2°( <i226d ). (22) 
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Also, by construction we have 

m 

2 k ' +2 A C (J 2x + at + Imagc(P) C Image(P'). (23) 
»=i 

Furthermore, we observe that 

Imagc(P') C Image(P) + [-1, l] z ■ {h - ci) + . . . + [-1, l]z ■ (6 d - cj). 
Since & i; Cj G 2 fe ' +2 A, we have [-1, l] z ■ (£>» - Cj) C -6j - Cj + 2 fe ' +3 A and thus 

xi + Image(P') C (m + l)2 k ' +3 A 
where xi := 2x + Y^Liih + Cj). 

Let /' be the largest integer such that i'(m + l)2 fe + 3 < I; note from (16), (18) that 
V is at least 1 (if C\ is large enough). Applying Lemma 6.1 we can thus find a 
symmetric coset progression Q of rank < d! < r + m such that 

sizc(Q) > (r + m)-°( r+m ) 2 (Z') d 'sizc(P') > 2 -2°t d226d ^d' | ^| (24) 
(using (20), (21), (22)) and 

Image(Q) C /'lmage(P') C 0((r + m)°« r+m ) 2 ))Imagc(Q) C 2 2 ° <d226ti) Image(Q). 

(25) 

On the other hand, we have 

Z'lmage(P') C -I'xi + l'(m + l)2 k ' +3 A C -l' Xl + IA 

and so I A contains a translate of Image(Q). On the other hand, from (23), (25) we 
have 

Z'2 fc '+ 2 AC 2 20<d22M) Imagc(Q) 
and hence (by definition of /') 

IA C 0(TO2 2 ° (d226d) )Imagc(Q) C 2 2 ° (d226d) Imagc(Q). 

We now have the required size and containment bounds on Q. The only problem 
is that we have only a very poor bound on dl . We can improve it by noting that 

sizc(Q) < \IA\ < 2 2 ~ 1 l d \A\ 

and hence by (24) we have d! < d — 1 if C\ is sufficiently large (compared to d and 
the hidden constant in the O of (24)). The proof of Theorem 1.21 is now complete. 
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